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ABSTRACT, The use of rotationally symmetric operators in vision is reviewed and conditions for rota¬ 
tional symmetry are derived for linear and quadratic forms in the first and second partial directional deriva¬ 
tives of a function f(x, y). Surface interpolation is considered to be the process of computing the most 
conservative solution consistent with boundary conditions. The "most conservative" solution is modelled using 
the calculus of variations to find the minimum function that satisfies a given performance index. To guarantee 
the existence of a minimum function, Grimson has recently suggested that the performance index should be 
a semi-norm. It is shown that all quadratic forms in the second partial derivatives of the surface satisfy this 
criterion. The seminorms that are, in addition, rotationally symmetric form a vector space whose basis is the 
square Laplacian and the quadratic variation. Whereas both seminorms give rise to the same Euler condition 
in the interior, the quadratic variation offers the tighter constraint at the boundary and is to be preferred for 
surface interpolation. 
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1. Introduction 


Two separate themes from the Computer Vision literature come together in this paper: the use of rota- 
tionally symmetric operators, and the idea that several modules of visual perception require that the "most 
conservative" solution that meets a given set of boundary conditions be computed. The two themes are 
combined in an investigation of which operator to use in the interpolation of smooth surfaces from one¬ 
dimensional boundary constraints. Such constraints arise naturally in a variety of visual problems. 

In the next section we review the role of rotationally symmetric operators in Computer Vision, and we 
derive conditions which linear and quadratic forms in the first and second directional derivatives must satisfy 
in order to be rotationally symmetric. We then discuss the idea that vision is a conservative process, citing 
examples from both figure perception and scene analysis. The "most conservative" solution is modelled using 
the calculus of variations to find the minimum function that satisfies a given performance index. A major 
problem associated with the use of the calculus of variations is guaranteeing the existence of an minimum 
function (see for example Courant and Hilbert 1953, p.173). A theorem of Grimson(1981, theorem 2) proves 
that a sufficient condition for the existence of a minimum is that the performance index should be a seminorm 
on the space of functions. The condition is not necessary. For example, Horn(1981) has determined the curve 
that minimizes the integral square curvature subject to tangency conditions at the end points; the performance 
index is not a seminorm. 


Grimson(1981) notes that many intuitively plausible performance indices based on mean and Gaussian 
curvature are not seminorms, but that the square Laplacian f 2 xx + 2 f xx f yy + f 2 y and the quadratic variation 
fix + yl v + flu are - We show here that any quadratic form in f xx , f xy , and f vu is a seminorm. 


To further constrain the choice of performance index in the infinite set of quadratic forms, we require 
in addition that the quadratic form should be rotationally symmetric. We prove that there are essentially two 
different choices: the square Laplacian and the quadratic variation. All the remaining possibilities are linear 
combinations, that is, form a vector space with these two as a basis. 

To choose between tire square Laplacian and the quadratic variation, we consider their respective Euler 
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conditions and natural boundary conditions (Courant and Hilbert, 1953). The Euler conditions are identical, 
but the natural boundary conditions, which are derived from the statics of a deformed thin plate, favor the 
quadratic variation since they offer tighter constraint in this case. 

2. Rotationally symmetric operators in vision 

A major concern of Computer Vision is the isolation of constraints that combine with the information 
provided in the image to yield an interpretation. Early work on polyhedra (Clowes 1971, Huffman 1971, 
Mackworth 1973, Waltz 1972, Sugihara 1978,1981, Kanade 1981) focussed upon the discovery of constraints 
deriving from the image forming process, constraints that relate image fragments, like junctions and lines, to 
their scene counterparts, vertices and edges. As Computer Vision turned its attention away from plane-faced 
objects to the natural world, other constraints were required. Often the constraints expressed some facet of the 
intuitive notion of "smoothness” and did so in a way that supported useful computations (Strat 1979, Brooks 
1979, Ikeuchi and Horn 1981, Woodham 1978, Horn and Schunck 1981). Recently, smoothness and image 
forming have been combined using differential geometry (Grimson 1981, Witkin 1981, Binford 1981). 

One constraint that is usually implicit, but is occasionally made explicit, expresses the idea that perceptual 
processes are often approximately isotropic. It seems that humans usually do not show strong directional 
preferences when detecting edges, motion, or reflectance boundaries. We seem to be equally adept at per¬ 
ceiving the layout and orientation of a visible surface regardless of its orientation relative to the view vector. 
Ullman(1976) argues for an explicit isotropy constraint in his work on subjective contours (see also Knuth 

1979). 

Processes that are isotropic are naturally computed by rotationally symmetric operators, since the values 
dicy return are unaffected by the coordinate system chosen for the image. Conversely, rotationally symmetric 
operators compute isotropic information. As we shall see, many operators that have been proposed for vision 
are not rotationally symmetric but directionally selective. Some authors have, however, proposed rotationally 
symmetric operators, particularly for early visual processing. 





Precise definitions of rotational symmetry for functions, operators (or functionals), and, by specialization, 
matrices are given in the following section. In the rest of this section we assume that the definitions are already 
understood. 

Some kinds of blurring in an image forming system can be approximated by convolution with a 
Gaussian. The rotationally symmetric Gaussian can be defined by: 

G(r) = ^7r<7 2 exp(-^-). 

Pratt(1978) presents several techniques, such as convolution with the generalized inverse of the blur 
function, for restoring the image, (see for example, his figures 14.2.1,14.3.2). 

The Laplacian A = f xx + f yy is well known to be rotationally symmetric^ and its use has been proposed 
several times in Computer Vision and Image Processing. If an image is blurred in a way that can be ap¬ 
proximately modelled by passing the image through a system with a Gaussian point spread function, then it 
can be sharpened by subtracting a multiple of its Laplacian (Rosenfeld and Kak 1976, p.184, Prewitt 1970, p. 
107). Pratt(1978, figure 17.4.5) illustrates the use of the Laplacian for enhancing the edges in an image. 

Weska, Dyer and Rosenfeld(1976) note that convolving a step edge with a Laplacian operator gives rise to 
a pulse pair: a negative pulse at the transition from the lower plateau to the edge, and a positive pulse at the 
transition from the edge to the upper plateau (see also Horn 1974, Marr and Hildreth 1980). They suggested 
that the image intensities at the locations of the positive and negative pulses could be used to set thresholds to 
use in segmenting the image into regions. 

Several authors have noted the relative insensitivity of human perception to small intensity gradients 
(Herskovits and Binford 1970, Marr 1976, Marr and Hildreth 1980, McCann et. al. 1974). They have noted 
that the effect can be explained by assuming that the vision system uses operators approximating second 
derivatives. This so-called lateral inhibition effect seems to be performed by center surround operators in 
the retina (see for example Richter and Ullman 1980). The Laplacian is a rotationally symmetric second 


t A proof of this is given in Section 3 below. 
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differential operator, and an attractive candidate to perform lateral inhibition. 

The use of the Laplacian for edge detection was proposed by Hom(1974) in a study of the determination 
of lightness. Following Land and McCann(1971), Horn restricted attention to images of planes colored with 
patches of uniform reflectance or color. Within a patch, grey level variations are due to small variations 
in illumination, and they are smooth compared to the abrupt changes between patches. The conventional 
approach to detecting significant changes in intensity had been to note that the gradient of the image is 
small within a region, but is infinite across a reflectance boundary between regions. For a particular image 
tesselation and quantization of grey levels, the gradient is always finite. It is usually much larger, however, at 
a reflectance boundary than it is within a region. Horn(1974) rejected using the gradient since "the first partial 
derivatives are directional and thus unsuitable since they will for example completely eliminate evidence of 
edges running in a direction parallel to their direction of differentiation." The Laplacian is the lowest order 
linear combination of derivatives that is rotationally symmetric. A reflectance boundary can be detected by the 
paired positive and negative peaks on either side of the boundary, and localized by noting the position where 
the Laplacian crosses zero between the peaks 1 '. 

Marr and Hildreth(1980) have proposed that edges are detected in the human visual system by an 
operator that approximates AG, where A is the Laplacian, and G is a rotationally symmetric Gaussian. We 
shall show in the next section that the application of a rotationally symmetric operator, such as the Laplacian, 
to a rotationally symmetric function, such as the Gaussian, is itself rotationally symmetric. It follows that the 
Marr-Hildreth operator is rotationally symmetric. Marr and Hildreth note that intensity changes occur at a 
number of scales and are often superimposed. They suggest that an image should be smoothed by a number 
of bandpass filters to isolate the changes at a particular range of scales. The Gaussian is chosen as the filter to 
optimize localization of changes in both the spatial and frequency domains. 

We noted above that the Gaussian and the Laplacian have figured prominently in early visual processing. 
The Gaussian has mostly been used to approximate the point spread function corresponding to the blurring of 

t See Bin ford (1981) for more on the distinction between detection and localization of an intensity change. 
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a point source, Marr and Hildreth deliberately introduce Gaussian blurring. They further note that A G can be 
approximated by a difference of Gaussians, G\ — G 2 - Nishihara and Larson(1981) note that the difference of 
Gaussians is to be preferred on grounds of efficiency. Macleod(1972) proposes an edge detection operator that 
is the difference of two Gaussians. However, no analysis of its performance is given, and no indication is given 
that the operator approximated a low-pass filtered second derivative. 

Regarding the use of the Laplacian, Marr and Hildreth do not seem to make isotropy an explicit con¬ 
straint on edge detection. Instead, Hildreth(1980,page 13) notes that "a number of practical considerations, 
which will be illuminated in the discussion of the implementation, suggested that the ... operators not be 
directional". Suppose instead that directional operators are used. The simplest algorithm for edge detection 
has two stages. First, the image is convolved with the directional operators in "sufficiently many" directions. 
Second, the outputs are combined to determine the orientation and extent of intensity changes. Regarding 
the first stage, both Marr and Hildreth(1980, page 193) and Hildreth(1980, page 40) claim that the cost of 
convolving the image with a "sufficient" number of operators is excessive. They show that a single rotationally 
symmetric operator (the Laplacian) gives precisely the same results if a condition called "linear variation" 
holds. Regarding the second stage, Hildreth(1980, page 36) observes that edges in a direction close to that of 
the mask are elongated in the direction of the mask. She also notes that operators at several orientations give 
significant responses to any given edge, and that combining the responses is non-trivial. 

There are two essentially different issues here that need to be clearly separated. Intensity changes first 
have to be detected and then localized as a set of "feature points" marking the position of the change in the 
image, and characteristics of the corresponding edge. The detection of feature points is inherently isotropic, 
as Hom(1974) noted. The feature points have then to be combined to produce descriptions of edge segments. 
Edge segments are clearly directional, indeed a central problem concerns the determination of the direction of 
an edge in an image. The computation of rich descriptions of edge segments is, as Hildreth notes, not at all 
easy. Marr’s(1976) original Primal Sketch work was almost entirely concerned with it. Binford(1981) discusses 
die application of directional operators to compute the directionality of an edge. 








The Gaussian and Laplacian are not the only rotationally symmetric operators that have been proposed 
in computer vision. Prewitt(1970, p. 107) observes that "derivatives of all orders can be used to form isotropic 
nonlinear differential operators, provided that derivatives of odd order appear only in even functions. The 
simplest of these... is the squared gradient", namely V • V, where V is the column vector 
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Earlier in the same article, Prewitt(1970, p. 85) suggests that "the Hankel transformation enters naturally 
in the analysis of systems with isotropic point spread functions and greatly facilitates restoration." The sugges¬ 
tion does not appear to have been investigated in computer vision. 

We noted earlier that an important aspect of modelling perception is the isolation of constraints which 
capture some facet of smoothness. Horn and Schunck(1981) consider the determination of optical flow fields 
and note that "if every point of the brightness pattern can move independently, there is little hope of recover¬ 
ing the velocities". One way to express the additional constraint of smoothness is to minimize the integral of 
the performance index 

S{u, t>) = {ul + u 2 y ) + {v 2 x + v 2 y ), 

where u and v are the x and y components of the optical flow, and subscripts denote partial differentiation. 
We shall show in the next section that this operator is rotationally symmetric. In many simple situations the 

smoothness constraint is significantly wrong only at occluding boundaries. 

We conclude this review of the use of rotationally symmetric operators in vision with Grimson’s(1981) 
work on surface interpolation. As it will be the focus of Section 5, our remarks will be brief. The Marr-Poggio 
theory of human stereo vision yields the disparity (scaled depth) at matched edge points that are computed 
by the Marr-Hildrcth approach described above. The disparity map is as sparse as the set of matched edge 
points, whereas human perception is of smooth surfaces passing through the given disparity points. Crimson 
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(1981) interpolates a smooth surface from the given set of edge points by a local parallel algorithm that applies 
a rotationally symmetric operator to minimize the quadratic variation introduced above. 


3. Conditions for rotational symmetry 


A function fM 7 t-+ Sft is rotationally symmetric if its polar form is only dependent on radial distance 
r — (x 2 -f y 2 )i and not on direction <p — tan -1 §. Clearly, a function is rotationally symmetric if and only 
if it can be represented as a function of ( x 2 -)- y 7 p. An alternative definition can be given that is often more 


convenient for functions, and that can be generalized to operators. A function is rotationally symmetric if and 
only if it yields the same value under an arbitrary rotation of coordinates. 

An anticlockwise rotation from one set of image coordinates (x, y) to another (X, Y) is effected by a 
rotation matrix: 



sin <j> 
cos 0 


x 

LV. 


(0) 


For convenience, we shall denote co s<f> by c and sin <f> by s. To simplify notation, we shall not make 
explicit the dependence of the rotation matrix R on the angle <f>. A function / is rotationally symmetric if and 
only if the untransformed version f(x, y ) is equal to the transformed version f(X, Y ). We shall occasionally 
find it useful to borrow the mathematical shorthand that equates a function f(X, Y) with a function of a single 

rp 

vector argument f{R[x, y] ). 

Example 1. The function fi(x, y) = (x 2 -f y 2 ) is rotationally symmetric: 


fi(X,Y) 


= {[xc + ysf -f (yc — xs) 2 ) 
= (x 2 + y 2 ) 

= Mx, y )• 


Example 2. The function fi{x, y) 


— xy is not rotationally symmetric: 








f 2 (X, Y) = (xc -f ys)(yc — xa) 

= xy cos 2 <t> -j--— sin 2 <j>, 

and so h{X, Y) — fa{x t y) only when <£ = 0 or 0 = ir. 

We can extend the definition of rotational symmetry to operators 

0:(& 2 !-»»)>-♦ (3ft 2 3ft). 

An operator 0 is rotationally symmetric if 0(/) is a rotationally symmetric function, for all functions 
/:3ft 2 i—► 3ft. 

Example 3. The function produced by the operator Oi, defined by 

Oi (/)(*, v) = ^ x ' y) 

is rotationally symmetric if and only if / is. In general then, the operator Oi is not rotationally symmetric. 
However, the Gaussian is a rotationally symmetric operator as it combines examples 1 and 3. 

Most of the operators of interest in computer vision are combinations of the first and second directional 
derivatives Jg, gp, ggj, and We need to determine the effect of a coordinate rotation on these 
directional derivatives. By the chain rule, 

d dX d dY d 

dx ~ dxdX + dx dY 

d d 

“ C dX S dY ' 

d d . d 
dy~ 8 dX + C dY 


Similarly, 




It follows that 




where T denotes matrix transpose. Since R is a rotation matrix, its transpose equals its inverse, so 




Operators in general, and differential operators in particular, depend upon the choice of coordinate 
frame. To make tire dependence of the differential operator on the choice of coordinate frame explicit, we 
introduce the notation 


With this notation, equation (1) becomes 

V ( x,y)-**(.,* 

where V( xy ) is the column vector 



Proposition 1. Linear combinations of ^ and are not rotationally symmetric. 

Proof. Any linear form in the first directional derivatives has the form 

I* 

'l'hc condition for rotational symmetry is 
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[X /*]V(x,y) = [X (j]V(x,vy 


By equation (2), 


[X £t]V(x,y) = [X fi]RV( Xi y), 


and so the linear differential operator is rotationally symmetric if and only if 


[X //] = [X n]R, 


so that [X a*] is an eigenvector of R. The eigenvalues of R are c -j- is and c — is. So there are no real 
eigenvectors unless 4> is a multiple of 7r. Since the condition is not satisfied for all <f>, no linear combination is 
rotationally symmetric. | 

The same style of analysis can be applied to other combinations of first derivatives such as the operator 


Hf) 


U + fv 


It is easy to show that 0 2 (x,y) is not equal to 0 2 ( z , y ), for example when <j> — f. 
In section 2, we referred to an operator proposed by Prewitt(1970), namely 


\dxJ \SyJ ’ 


that is, the vector dot product 




More generally, we often consider quadratic differential expressions such as 


# 
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* 






V L) 


X fi 

y 6J 




Such an expression is called-a quadratic form if the matrix is symmetric, that is n — v. By equation 1, 


(X,Y) 


RV 




so that 


if and only if 


R t MR = M, 

where R is an arbitrary rotation matrix, and 



Since the transpose R T of a rotation matrix R is the inverse of R, a quadratic form is rotationally 
symmetric if and only if the corresponding matrix M commutes with all rotation matrices. We will refer to 
matrices M having this property as being rotationally symmetric. 

Lemma 1. A 2 by 2 matrix is rotationally symmetric if and only if it has the form 



M" 

X 


Proof. We require RM — MR for all rotation matrices R, that is 
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‘c —elTX fil [X ti\\c —8 

s c . y L y £. c . 

Expanding, and equating terms, this holds if and only if 

/x + 1 / = 0 

\ = e. 

Alternatively, only the operations of scaling by a constant k and multiplication by a rotation matrix R' 
commute with all rotation matrices in two dimensions So M — kR! for some scale factor k and some rotation 

matrix R'. g 

Proposition 2. Up to scaling, the only rotationally symmetric quadratic form in ^ and ^ is V (Xiy ) • V( x _ y ). 
Proof. A quadratic form in and has the form 

Vf,J X jv,)- (3) 

To be rotationally symmetric, as well as symmetric (so that it is a quadratic form), Lemma 1 implies that 

\ = £ 

H — 0. ■ 

It follows that the matrix in equation (3) is X/ 2 -I 

The operator fl + fl is commonly used as a measure of the contrast across an intensity change. Notice 
that other popular measures of the contrast, such as (f x 4* f y ) 2 , {f x — fy] 2 , or ||/ x || + ||/ y || are not rotationally 
symmetric, and therefore respond differently to edges in different directions (sec Rosenfeld and Kak 1976, 
p279). 

We now consider linear and quadratic forms in Jp, and Jpj. It is convenient to not assume 

for th e developments that follow. - 

The first task is to find a matrix R* so that 






( 4 ) 


The ( i, j) element of the matrix R * will be denoted by r,y. Applying the chain rule as before, but this 
time to relate the second derivatives in (X, Y) to those in ( x, y), we find that the four by four matrix R* can be 
written in the form 



rnR T r 2l R T 
rnR T r 22 JR T 

tm 



Definition 1. (ben Israel and Greville 1974, page 41)Let A — [a tJ ] andS = [6 fj -] be m by m and n by n 

\ 

matrices respectively. The mn by mn matrix A 0 B, called the Kronecker product of A and B, is defined by 
multiplying each element a(i, j) of A by the matrix J3, to form the block matrix 


With this notation, 


'onB 

ai2 B 

• • * &1 mP 

a2\B 

• 

• 

022 B 

• 

• 

• * • 

• 

# 

« 

fim\B 

• 

a m2 B 

• 

* • * * 


( 6 ) 


R* —R T ® R t , 


so that 


t Recall the definition of the matrix R from equation (0). 
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‘c 2 

— 8C 

— 8C 

2 1 
8 l 

8C 

C 2 

—8 2 

— 8C 

8C 

-8 2 

C 2 

—8C 

J 1 

8C 

SC 

c 2 . 


Note that the elements of A (g) B are naturally indexed by 4-tuples: 



( 7 ) 


{A 0 B} — dijhki. 

We state without proof a number of simple properties of the (g) operation. They are essentially 
straightforward consequences of the properties of ordinary multiplication, and are stated without proof. 


Proposition 3 

(0 \a®b) t = a t ®b t 

(«) (A ® B)~ l = A~ l ® B- 1 . 

^ (n't) {A <g> B) <g> C = A ® {B ® C) 


For the remainder of the paper, we restrict attention to the application of (g) to /? and its transpose. 
Proposition 4. The rotationally symmetric linear combinations of g^, and are linear 
combinations of the Laplacian A = and the smoothness measure — hi¬ 

proof. Let the linear combination be 


[X fi v 



Following the proof of Proposition 1, the condition for rotational symmetry is 

[X (i » £]R r <g) i? r = [X // r/ & 
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for all rotation matrices R and the corresponding rotation angle <f>. Expanding R T 0 R? by equation (7), we 
find 


so that 


It follows that 


[* M v 6] 


[k n v £} 


r 2 

c £ 

'—SC 

—ac 

2 1 

8 l 

8C 

C 2 

—a 2 

— 8C 

8C 

— 8 2 

c 2 

— 8C 

J 

8C 

ac 

c 2 . 

-a 2 

— 8C 

—ac 

a 2 

8C 

-s 2 

-a 2 

— 8C 

8C 

-s 2 

-a 2 

— 8C 

8 2 

8C 

ac 

— 8 2 


I* M v £], 


[0 0 0 0 ] 


[X — $ n -j- v 0 0] 


-2a 2 

—28C 

—2ac 

2a 2 * 

2ac 

-2a 2 

—2a 2 

—2ac 

0 

0 

0 

0 

. 0 

0 

0 

0 . 


[0 0 0 0 ]- 


The determinant of the upper left 2 by 2 submatrix is 


(4a 4 -f- 4a 2 c 2 ) = 4a 2 . 




Since this is not zero for all angles <f>, X — $ and n-\-v are both zero. A basis for the infinite set of 
linear combinations satisfying these conditions is provided by setting X and n equal to one, which proves the 
Proposition. | 

Before turning to quadratic forms, analogous to Proposition (2), we define a projection, operator on 
RT 0 R t that makes explicit the assumption f xy = f yx . 
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Definition 2. Let D — [djj] be a 4 by 4 matrix. The projection of D is the 3 by 3 matrix D *: 

m 

du (di 2 + ^13) d \4 

(d>2 1 ^3i) (^22 4“ ^32 +^23 + ^33) (^24 4~ ^34) 

^41 (^42 4 “ ^ 43 ) ^44 

a* " 

That is, the second and third columns as well as the second and third rows are combined by addition. 
Proposition 5. 

[abb c]2?[a b b c] 7 

is equivalent to 

[a b c\D* [a b c] T > 

where D* is the projection of D. 

The proof is by equating terms, and is omitted. We now give the main result of this section. 

Proposition 6. The rotationally symmetric quadratic forms in g^, g^gj, and form a vector 
space. If the matrices associated with the rotationally symmetric quadratic forms project to 3 by 

3 matrices of the form 

a -j- /? 0 /? 

0 2a 0 

j3 O a-f/?_ 

It follows that the rotationally symmetric quadratic forms that satisfy g^ = gpj form a vector space 
that has the quadratic variation, (J^) 2 + 2(g^) 2 + (gj^) 2 , and the square Laplacian, + op) 2 < as a 
basis. 

Proof. Since the matrix in a quadratic form is defined to be symmetric, a quadratic form in g^, 
gpj. and ^ can be written 
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Similarly, equation (8) or (9) when <j> == $ yields 


&11 + &22 = 0 * 

Expanding equation (8) for general <j> yields 


&11 ■+■ °i2 — 

622 — <*21 = 0, 
621 “I - &12 " 4 " 022 — 011 =s 0- 


(13) 


(14) 

(15) 

(16) 


Combining equations (12) through (16) we find that in order to be rotationally symmetric, the matrix 


/ 

has the form 


A matrix of this form projects to 



A 

S' 



B t 

c. 


'a + p 

7 

—7 

P 

7 

a — 6 

6 

7 

—7 

6 

a — 6 

—7 


L P 7 —7 <* + Pl 


a -(- P 0 P 

0 2a 0 
(3 0 a -f (3 


where a = 6 J2 — flu and P = &j 2 . It is easy to show that linear combinations of matrices of this form are 


of the same form, so that the rotationally symmetric quadratic forms constitute a vector space. Clearly, the 
square Laplacian and the quadratic variation, corresponding to the cases a — 1, P — 0 and a 0, P — 1 
respectively, form a basis.| 
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We show that the measure of smoothness of optical flow proposed by Horn and Schunck(1981) is rota- 
tionally symmetric. Recall from section 2 that the measure is defined by the operator 


S(u, t>) = (u* + uj) + ( v\ + v 2 y ). 


We extend the Kronccker product operator 0 to vectors, and then show how to define S(u, v) in terms 
of vector Kronecker products. 

Definition 3. (a) Let a — [oj.. .o m ] and b = [6 t .. .6 n ] be vectors. The Kronecker product of a and b is the 
mn dimensional vector [aj 6 j.. .aj 6 n 0261 .. .a m 6 n ], 

(b) By extension, if 0 = [Oj.. .O m ] is a vector of operators and / = [f\.. ./„] is a vector of functions, the 
Kronecker product of 0 and / is the mn dimensional vector of functions 


[Oi(/i).. .Oi(/ n ).. .O m (/ n )]. 


The components u and v of optical flow are functions of x, y, and t. Recall that ft!' 

According to definition 3, 


® [u = [ 


T ,du du 


dy 


8y 


so that 


S(u, v) = (V„, t) ® [u «] T ) • ® [u v) 7 -). 

If the coordinate frame is rotated through <f> by the matrix R, the optical flow components become R[u v) T . 

The Horn-Schunck measure is rotationally symmetric if and only if 


(R 0 R) t {R 0/2) = I 4 , 
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where I 4 is the 4 by 4 identity matrix. The rotational symmetry is a simple consequence of Proposition 3. 

A rotationally symmetric operator has the general form 

<WV, V ® V, V (g> V (8) V, • • •), 

and its application to a rotationally symmetric function f{x, y) has the form 

0 (l , y) (/(a:, y)). 

To see that this is rotationally symmetric, we rotate the coordinate frame to (X, Y) by a matrix R as before. 
Since 0 and / are rotationally symmetric, all the occurences ofi? (including its Kronecker square, cube, and so 
on) introduced by the frame change can be deleted. It follows that the application of a rotationally symmetric 
operator to a rotationally symmetric function is itself rotationally symmetric. In particular, the A(G) filters of 
the Marr-Hildreth theory of edge detection are rotationally symmetric. 

4. Vision as a conservative process 

The second theme of this paper is that a number of vision modules construct the most conservative inter- 

♦ 

prctation that is consistent with the given data, and that is subject to an appropriate set of suitably formulated 
constraints. A major concern of Computer Vision has always been the isolation of constraints that enable tile 
interpretation of an image. Constraints embody observations about the way the world is, at least, most of the 
time. Although such observations can be as specific as cataloging familiar figures and shapes, it has proved 
more fruitful to first uncover constraints that correspond to general observations that are widely applicable. 
Constraints are used together with the data computed from the image to construct an interpretation. The 
representations of the information from the image and the constraints determine, and are determined by, 
the interpretation process. For example, early blocks world programs represented constraints as catalogs of 
labellings, an approach that led naturally to search processes for interpretation (Clowes 1971, Kanadc 1981). 




As Computer Vision has turned its attention to images of the natural world, constraints have concerned 
the smoothness of surfaces and movement. The relationship to boundary value problems of physics and 
mathematics suggests itself. The information computed from the image sets the boundary conditions, and the 
constraints determine which (and whether a) solution to the boundary value problem is found. Hom(1974) 
solved an instance of Poisson’s problem using Green’s functions to determine the lightness of an image, 

Following a different approach, Ullman(1979a) studied the perception of apparent motion generated 
by two successive frames consisting of isolated dots of equal intensity moving independently of each other. 
Without constraint, all possible pairings, or "correspondences", of dots in the first frame with dots in the 
second are equally likely. Ullman defined the "most likely" correspondence to be the one that minimized the 
sum 


l<i<n 

1<?<™ 

where n is the number of dots in the first frame, m is the number of dots in the second frame, and x\j is one if 
the ith dot of the first frame P t is paired with the jth dot of the second frame Qj, else zero. The weight qij is 
the "cost" of pairing P, with Qj, and might, for example, be related to the image distance between the paired 
points. The problem of finding the minimal correspondence is considered in terms of integer programming. If 
correspondences are assumed to be covering mappings, the following linear constraints apply to the Xif 

Vt, 1 < i < n Xij > 1, 

l<i<m 


V;, 1 < j < m V Hj > 1. 

l<»'<n 


and 
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Ullman restricted the set of Qj that can be paired with P t to those whose positions were close to P». Following 
Arrow, Hurwicz, and Uzawa(1958), he set up the iterative scheme 



The approach can be extended to mappings that are not one-one, as well as to continous motion. A major 

problem with the approach is guaranteeing the convergence of the algorithm. This is determined largely by the 

\ 

properties of the costs j, but this was not investigated, aside from a comment on the empirical determination 

« 

* 

of the qij (see also Ullman 1979b). 

One limitation of Ullman’s approach is that it is restricted to minimizing a known linear objective func¬ 
tion that is subject to linear constraints. The method can be extended to constrained nonlinear programming 

. % 

in which the goal is to minimize a known function f(x) subject to a set of equality and inequality constraints 
of the form gjx) < 0. In general, however, criteria based on other than intuition need to be found for 
selecting the function / to be minimized. To do this, one can apply the calculus of variations (see for example 
Courant and Hilbert 1953, chapter IV). The familiar differential shows how to find a real valued parameter 
that minimizes some function. The calculus of variation extends the differential calculus by showing how one 
can determine a junction f, which is subject to a given set of boundary conditions, and minimizes the integral 


9(/) = / j F{x, y,f,fxjyjxx,}xy,f vy ) d xdy (17) 

over a given region of integration (7*. The function F is called a "performance index" and generalizes the 
notion of cost function associated with linear and nonlinear programming. In tire next section we shall con¬ 
sider die choice of a performance index for interpolating smooth surfaces from one-dimensional boundary 
c onditions. _ 

t For simplicity of presentation, we restrict attention to functions / of one or two variables x, y. 



* 






23 


Associated with a variational problem of the form (17) is the Euler equation, which provides a necessary, 
though by no means sufficient, condition which a function / must satisfy if it is to minimize the variational 
integral 9(/). For the particular variational problem given in equation (17), the Euler equation is 


7 


d_ 

dx 


d & & & 

? f — —Ft 4- —Ft 4- ——Ff 4- —Ft 
fx dy Iv ~ dx 2 Ixs ~ dxdy Ixv ~ dy 2 f ' 


VV 


0 . 


(18) 


In the case that there is only a single dependent variable x, the partial derivatives are total and the Euler 


equation becomes 


F > ~ t/’-+ - 0 < 19 > 

There are two important considerations associated with the use of the calculus of variations. First, unlike 
the differential calculus, the existence of an extremum /* of the integral given in equation (17) cannot be taken 
for granted. Courant and Hilbert(1953, p. 173) note that "a characteristic difficulty of the calculus of variations 
is that problems which can be meaningfully formulated may not have solutions". Conditions for the existence 
of a minimum have recently been proposed by Grimson(1981) and will be discussed in the next section. 

Second, associated with any variational problem is a set of natural boundary conditions which imposes a 
necessary condition on any feasible solution to the Euler equation at the boundary. Courant and Hilbert(1953, 
p. 211) note that "in general, we can, by adding boundary terms or boundary integrals essentially modify 
the natural boundary conditions without altering the Euler equations". Determining the "most conservative" 
solution means finding a performance index that guarantees the existence of an extremum function f* and 
provides the tightest set of natural boundary conditions that are consistent with the given data. 

The calculus of variations has recently been applied by a number of authors to interpolate plane and 
space curves and surfaces. We review the applications in that order. First, Horn(1981) has recently determined 
the curve which passes through two specified points with specified orientation while minimizing 
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J K 2 ds, (20) 

where «is the curvature and a is the arc length. This is the true shape of a spline used in "lofting" (Faux and 
Pratt 1979,p. 228). In a thin beam, curvature is proportional to the bending moment. The total elastic energy 
stored in a thin beam is therefore proportional to the integral of the square of the curvature. Since the shape 
taken on by a thin beam is the one which minimizes the internal strain energy, the curve that solves equation 
(20) is called the "curve of least energy". The variational problem is to minimize 

/— dtx. 

J ( 1 +/ 9 * 

This has the form of equation (17). Hom(1981, page 19) shows that the Euler equation is 

— CK == V cos ip, 

where ip is the angle between the tangent to the curve and the axis of symmetry. The solution to this 
differential equation is an incomplete elliptic integral of the first kind. Brady, Grimson, and Langridge(1980) 
consider a "small angle" approximation to the curve of least energy, in which first derivatives can be ignored. 
The performance index that they use is f 2 x , for reasons that will become evident in the next section. They find 
that in that case the solution is a cubic. Hom(1981,page 2) notes that the fact that a curve has near minimum 
energy does not mean that it lies close to the curve of minimum energy. Note that the existence of the curve 
of least energy is guaranteed as Horn has derived an analytical formula for it. Approximations to it, such as 
Brady, Grimson, and Langridge’s are similarly guaranteed to exist. 

Barrow and Tenenbaum(1981) investigate the problem of interpreting a line as the image of a space curve 
that is an occluding boundary. They observe that the problem has two parts: (i) determining the tangent 
vector t at each point on the space curve, and (ii) determining the surface normal at each point, given that it is 
constrained to be orthogonal to the tangent. 






They suggest minimizing a performance index F that is a function of the curvature « and the torsion r 

(possibly together with their derivatives), and expresses a suitable notion of "smoothness". They first consider 

# 

uniformity of curvature as a measure of smoothness, that is F = — /q, where s measures distance along 

the space curve. They reject this measure on the grounds that can be made arbitrarily small by "stretching 
out the space curve so that it approaches a twisting straight line". To overcome this difficulty, they propose 
that the space curve should also be "as planar as possible or, more precisely, that the integral of its torsion 
should be minimized". 

Barrow and Tenenbaum finally suggest finding the space curve that projects to the given image line and 
minimizes the performance index [^r^] 2 , where b is the binormal. They report that an algorithm based on 
their analysis produced the "correct 3-D interpretations for simple and closed curves, such as an ellipse, which 
was interpreted as a circle". However, they note that the rate of convergence was slow and dependent on the 
initial data. No consideration is given to the Euler equations, to the existence of an extremum given a line 
drawing (x(s), y(s)}, or to the natural boundary conditions associated with the performance index [^p] 2 . 
Empirical evidence that the method works on a number of simple test cases is encouraging; but there is no 
analysis of the scope of the method. 

In the same paper, Barrow and Tenenbaum(1981) consider the interpolation of a smooth surface from 
depth and local surface orientation values at all points along the surface boundary. Their approach is to 
"seek a technique that yields exact reconstructions for the special symmetric cases of spherical and cylindrical 
surfaces, as well as intuitively reasonable reconstructions for other smooth surfaces." (Barrow and Tenenbaum 
1981). They observe that if n is the surface normal of a cylinder, then the x and y components of the normal 
n x and n y are linear functions of x and y, so long as the axis of the cylinder lies in the x — y plane. This 
observation forms the basis of an algorithm to estimate the surface normal by least squares fitting of the 
parameters of the partial derivatives of the normal. As before, no analysis is given of the Euler equation, the 
natural boundary conditions, nor the convergence of their algorithm for different types of surface. 
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5. A performance index for surface Interpolation. 

In the review of the application of the calculus of variations to visual perception in the previous section 
we drew attention to three important considerations. First, the Euler equations provide a necessary condition 
on possible extremal functions. Second, the existence of an extremum cannot be taken for granted, even when 
the minimization problem seems plausible on some grounds. Third, the natural boundary conditions impose 
a necessary condition on any feasible solution to the Euler equation at the boundary. The most thorough 
analysis of the second of these problems in Computer Vision, framed in the context of surface interpolation, is 

due to Grimson(1981), who proves the following theorem. 

Theorem (Grimson, see Rudin(1973)) Suppose there exists a complete semi-norm F on a space of func¬ 
tions 5, and that F satisfies the parallelogram law. Then, every non-empty closed convex set 8 C 9 contains a 
unique element/* of minimal normF(/*), up to possibly an element of the null space ofF. 

A semi-norm F is a function V ■-+ &+ from a vector space V to the positive real numbers that satisfies 

F(v + w) < F(v) + F(w) 

F(av) = |a|F(v). 

Informally, a semi-norm is a generalization of the Euclidean metric, and provides a measure of a vector. The 
second condition generalizes the triangle inequality, for example. The null space of the semi-norm F consists 
of all those vectors vq that map to zero. Since 


F(t> + vq) = F(t>), 

any element of the null space can be added to a vector of minimal norm to yield another vector of minimal 
norm. Hence the qualifying phrase "unique ... up to possibly an element of the null space of F". The 
parallelogram law states that 

[F(i> + u;)] 2 + [F(v - te)] 2 = 2 [F(v)] 2 + 2[F(u;)] 2 , 




for all vectors v, w. Finally, the semi-norm is complete if all Cauchy sequences converge. As is well known, 
the elements of vector spaces can be functions. This enables Grimson to prove the following Corollary, that 
guarantees the existence of an extremum function in calculus of variations "most conservative" interpolation 
problems. 

Corollary (Grimson 1981). Let the set of known points be {(x,, y it zj | 1 < i < n}. Let 9 be a vector 
space of possible functions on 9? 2 and let 8 be the subset of IF that interpolates the known data. That is, for all 
functions fe 8, f{x it yi ) = Zj. Let F be a complete semi-norm on 8 that satisfies the parallelogram law. Then 
there exists a unique (up to the null space of F ) function f that interpolates the data and has minimal norm. 
In particular, if F is a performance index then there is a function /* that minimizes the integral 


m = I f. 

m 

In short, if the conditions of the Corollary are fulfilled, the existence of a "most conservative" surface that 
meets the boundary conditions is guaranteed. As we shall see, the condition of being a semi-norm is the most 
restrictive required of the performance index. The conditions are sufficient to guarantee the existence of a 
minimum, but they are not necessary. For example, k 2 is not a seminomF; nevertheless Hom’s(1981) analysis 
shows that there is a unique minimum. It is far from clear whether Barrow and Tenenbaum’s(1981) analyses of 
curve and surface interpolation have a guaranteed minimum in all cases. 

Grimson notes that several intuitively plausible performance indices are not semi-norms. For example, 
the two most popular measures of curvature are not. Suppose that /ci and K 2 are the principal curvatures of 
a surface(Faux and Pratt 1979, p. Ill), then the Gaussian curvature k 9 is the product and the mean 
curvature K m is the sum ki + K 2 - For a surface f(x, y), 


«#(/) = 



(1 +/ 2 + / 2 ) 2 ' 


t Which is why Brady, Grimson, and I^ngridge{1980) used the small angle approximation f\ x 





I 


28 


Since the curvatures can be negative, while a semi-norm is required to be positive, it is necessary to 
investigate 

K 2 g dxdy 

Grimson(1981) observes that « 2 (a/) |a|/c 2 (/) because of the denominator. If f x and f y are small, the 

denominator is approximately equal to one, and the numerator is a seminorm. Note that it is 







Grimson shows that the mean curvature re m is also not a semi-norm for exactly the same reason. The 
analogous small angle approximation is 


if XX + fyyf — (A/) 2 , 

the square Laplacian, which is a semi-norm. We find it convenient to denote the square Laplacian by Fj. 
Grimson(1981) chooses the quadratic variation 

/L + 2 /*y + /yy> 

on the grounds that its null space, consisting of all linear functions, is smaller than the null space of the square 
Laplacian. If we denote the quadratic variation by F q , we see that the approximation to the Gaussian curvature 
given in equation (21) is 

How shall we choose a performance index for surface interpolation, given that it has to satisfy the condi¬ 
tions of the Corollary? We have exhibited three candidates, are there more? Notice first that each of the 
semi-norms given above are quadratic forms in f xx , f xy , and f yy . It is easy to show that any quadratic form 
satisfies the semi-norm and parallelogram conditions, and so there, is an infinite set of plausible semi-norms to 







use to find the "most conservative" interpolated surface. We need an extra condition, and the one we choose 
is rotational symmetry, since we suppose that surface interpolation is an isotropic process. Proposition 6 of 
section 3 shows that the rotationally symmetric quadratic forms in f xx , f xy , and f yy form a vector space that 
has the square Laplacian ancl the quadratic variation as a basis. The choice of which performance index to use 
is thus effectively reduced to the square Laplacian, the quadratic variation, and linear combinations of them. 
How shall we choose between those two? In the light of our earlier discussion, two criteria suggest themselves: 
the Euler equations and the natural boundary conditions. 

Proposition 7. All rotationally symmetric quadratic forms lead to an identical Euler equation 

A 2 (/) = 0. 

Proof. We exploit the fact that the square Laplacian and the quadratic variation are a basis of the 
rotationally symmetric quadratic forms. 

a.Square Laplaciarr. The performance index is 

Pj = {fxx + fyy ) 2 • 

By equation (18) the Euler equation is 

-Q^i{l{fxx 4* fyy)} 4* g^ftifxx 4“ fyy)} ~ 

that is 


(A/) 2 = 0, 

as required. 

b .Quadratic variation: The Euler equation is 
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2 fxxxx 4“ 4 fxyxy “4“ 2 fyyyy 0> 

that is 

(A/) 2 = 0, 

provided that / is continuous of fourth order. 

c .Linear combinations ofFi andF q .\ Linear combinations clearly give rise to the identical Euler equation! 
The gist of Proposition 7 is that there is no difference between F q and Fi in the interior away from the 
boundary conditions. We can see the result of Proposition 7 in an alternative interesting way. Recall that 



is the semi-norm approximation to the Gaussian curvature (equation 21). The latter expression is an instance 
of a divergence expression, and Courant and Hilbert(1953, p. 196) note "If the difference between the in¬ 
tegrands of two variational problems is a divergence expression, then the Euler equations and therefore the 
families of extremals are identical for the two variational problems." 

Since F q and Fi have identical Euler equations, we analyze their natural boundary conditions in order 
to choose between them. We could approach this problem directly; but a more revealing route is available. 
Courant and Hilbert(1953, p250) consider the statics of a thin plate. In particular they determine the shape it 
assumes for a given force p(s ) along its boundary F and bending moments m(s) normal to its boundary. 

Courant and Hilbert note that the energy stored in the plate is the integral of a quadratic form in the 
principal curvatures «i and «2 of the surface, a result which can be derived from noting that the elastic energy 
stored in a thin strip (corresponding to any normal section) is proportional to the square curvature. It follows 
that the stored energy is locally 








Si = q(/cJ -f* k \) + 2/9«i/ci 

= a(« 1 + k 2 ) 2 + 2(/? — q)/cj/Ci 

= OK m + 2[p — a)Kg 

~ oF; 4~ 2(/? — q) —~ ■ 

= PFl + (o — P)Fq 

= «(mF + (1 - m)F 9 ), 

where p = -. It follows that the energy stored in the thin plate is a convex linear combination of the 
square Laplacian and the quadratic variation, which formally establishes its connection to the visual percep¬ 
tual problem studied here. Observe that setting the weight n = 1 gives the square Laplacian, while setting it 
equal to zero gives the quadratic variation. Note also that this expression for the stored energy makes use of 
the small angle approximation to the curvature used in equation 21. 

A second source of stored energy derives from the boundary conditions that are represented as a function 
p(s) along the boundary T of the plate and a bending moment m(s) applied normal to the plate. Courant and 
Hilbert(1953, p. 251) show that the natural boundary conditions associated with the plate are 

—A/ + (1 — lAifxxx] + 2 f xy x a y a 4- fyyy 2 a ) = p(s) 
d (9 

rjn (1 fxx x a%n 4" fxyi-^sVn 4“ ®nVa ) 4" /yyl/aVn) ==! ^( s )> 

that is 


—A/ 4- (1 — fi)(\x e y s ]H[x a y a ] T ) = p(a) 

J~A/ 4- (1 — M)^([*nyn] H [x,y 8 ] r ) = m(«), 

where H is the Hessian matrix 

m m 

fxx fxy 

♦ 

fxy fyy 

m m 

Gladwell and Wait(1979) quote version of this result due to Agmon(1965), that the biharmonic operator, 
which we showed was the natural boundary condition for the surface interpolation problem, has Dirichlet 





forms that are linear combinations of the square Laplacian and the quadratic variation. As an example of the 
constraint, consider a straight line contour aligned with the s-axis. Then [x e y g ] = [10] and [x n y n ] = [01]. 
The natural boundary conditions reduce to 

fyy ”1” ^fxx = p(^) 

fyyy t^fyxx — m(s). 

The constraint is tightest when pi is not equal to one. A similar result can be obtained for a straight line 
contour inclined at an angle a to the i-axis. The first of the natural boundary conditions is 

f xx (sin 2 a + n cos 2 a) + / yy (cos 2 a -f y sin 2 a) + (1 — m) sin 2 af xy . 

If H — 1, there is no constraint from the cross derivative. If pi is not equal to 1, at most one of the 
terms can be zero. We conclude that interpolation problems in which the small angle approximations used 
throughout our analysis hold it is preferable to choose pi not equal to one, that is to say to not use the square 
Laplacian as a performance index. The quadratic variation is an obvious choice, but so are linear combinations 
of the square Laplacian and the quadratic variation for which pi is not equal to one. Grimson(1981) chooses 
the quadratic variation since its null space is smaller than that of the square Laplacian. This is a precise way 
of saying that it imposes a tighter constraint. For example, the function f(x, y) — xy is in the null space of 
the square Laplacian but not in the null space of the quadratic variation. Since the quadratic variation has 
the smallest null space among the linear combinations of the square Laplacian and quadratic variation, this 
is an additional reason for choosing it. We would further expect that any differences between the quadratic 
variation and the square Laplacian would show up near the given boundary data but not in die interior, far 
removed from the boundary. This is what Grimson(1981) finds in a set of examples that compare surfaces 
interpolated using the quadratic variation and the square Laplacian. 
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